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ABSTRACT 


The basic filter-observer equations of Kalman for optimal and 
suboptimal filters are studied using the concepts of Lyapunov functions 
and stability theory. The Second Method of Lyapunov is used to form a 
basis for comparison of the convergence rates of such filters. Lyapunov 
functions are also used to derive constraining relations for the 
elements of the filter gain matrix leading to design criteria for sub- 
optimal filters. A derivation of the optimal filter gain based upon the 
Lyapunov function of a random variable is given. This derivation shows 
that the optimal filter converges most rapidly. A design of a sub- 
optimal filter for one class of signal models is given based solely upon 


stability constraints. 
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SYMBOLS AND ABBREVIATIONS 


an n square matrix unless otherwise specified 


an n vector. Scalars will be specified unless obvious 
from the notation 


independent variable time and is always a scalar 
(similarily for n in discrete time) 


determinant of matrix A 

trace of matrix A 

eigenvalue of matrix A 

norm of the argument (see Appendix A) 

magnitude of the complex scalar a 

neighborhood about the point Xo of radius 6. 
Taken as the usual sphere definition all x such that 
IIx-x I] < 6 

expected value of the random vector x 


is a member of, belongs to 


greatest lower bound 
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I. INTRODUCTION 


For many problems in control it is desirable to have some measure 


of all states in a given signal process. By their nature, many 


processes only allow us to physically measure or observe some of the 
states characterizing the process. Moreover, there is some amount of 
uncertainty in the measured states which may be characterized by 
additive measurement noise. Thus, estimation theory, as a device for 
estimating the states of a system in the presence of noisy and incom- 
plete observations, has emerged as an important part of modern control 
theory. 

In the early 1960's Kalman [K3,K4] advanced the well known iener 
filter theory [ref. D2,P1] by showing how such linear least square 
filters, which Wiener synthesized using classical frequency domain 
theory, may be realized in the time domain. This development coin- 
cided with the development of the digital computer and its use as an 
active component in process control systems. 

The basic filter equation of Kalman is really a recursive weighted 


average technique of the following form: 
= Tee Wer (1) 


where X is the corrected estimate, X is the projected estimate 
following known signal dynamics, G is the filter gain weighting 
factor, and Z is the signal observation including noise. G is 
generally time varying and its selection constitutes the basic design 
and nature of the filter. The difference equation of (1) becomes a 


differential equation in continuous time. 


“~~ 


The corrected estimate, X , 1S uSually an n vector of the states 
of the signal process, and the observation, Z , is an m vector with 
mo< on. 

The selection of the filter gain has two objectives: 


1. To provide an estimate of unobserved states 
based upon data from the observed states. 


2. To provide for some weighting of the observations 
to allow compensation for the measurement noise. 


Objective 1 is the basic observer problem which has emerged in recent 
years (such as the Luenberger observer [L1]) and does not consider 
any effects of noisy measurements. Objective 2, as stated, involves 
estimation theory and the stochastic. properties of the observation 
noise as well as the random input signals which drive the system. 

Current literature contains many examples of the. application of 
Kalman filters to various problems. The computational capability 
required to implement such filters is generally large, and often a 
much simpler implementation works nearly as well. The literature also 
indicates that the method generally employed to determine the parame- 
ters of these filter implementations involves large scale simulation 
trials. 

This thesis studies the properties of the basic filter observer 
equation in both continuous and discrete time using the Second Method 
of Lyapunov. Lyapunov theory [ref. K1,H1] was developed for investi- 
gating the stability properties of systems. Its application to the 
filter-observer problem has appeared in recent literature (Deyst and 
Price [D1]). However, this approach has not been fully exploited in 
the literature, and its possibilities are investigated in this thesis. 


This study leads to engineering insight into the operation of suboptimal 
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filters and provides some basis for comparison of the performance of 
these filters when applied to the same problem. Some specific results 
obtained include constraining relations on the filter gain matrix 
which insure: 

1. that the filter is asymptotically stable. 


2. that the filter will converge to Steady state 
at a rate faster than a given bound. 


Application of the filter observer equations as discussed here is 
limited to linear systems and it is assumed that the signal dynamics 
are known exactly. A review of stability concepts for linear time- 
varying systems and the Second Method of Lyapunov is included. 

Chapter II reviews basic definitions and stability theorems for 
linear’ systems, based upon the concepts of state-space theory, with 
emphasis on forced systems so that these results may be applied readily 
to the filter-observer — 

Chapter III reviews the Second Method of Lyapunov. This chapter 
includes some theorems on the determination of Lyapunov functions for 
certain classes of systems as well aS some theorems on transient esti- 
mation using Lyapunov theory which have not appeared in the literature 
previously. 

Chapter IV applies stability theory to the filter-observer homo- 
geneous dynamics and contains many original results including the 
formulation of a time invarient Lyapunov function for the filter. The 
same Lyapunov function may be used for a large class of filters 
applied to the same problem. Particular filters have different time 
derivatives of the Lyapunov function. This leads to a method of 


comparing convergence rates for various filters applied to the same 


1] 


problem. By applying the Hadamard-Gerschgorin theorem to provide 
sufficient conditions for positive definiteness of a symmetric matrix, 
certain constraining relations are derived for the elements of the 
filter gain matrix which lead to the design of stable filters. 

Chapter V considers the forced filter dynamics. New results 
include the introduction of the concept of a Lyapunov function for a 
random variable. A derivation, using these Lyapunov functions, of 
the optimal filter is given showing that such filters also converge 
most rapidly to the minimum covariance of error. 

Chapter VI applies the results of Chapter IV to the design of a 


th 


class of suboptimal filters for the general n° order signal model 


in phase variable form. 
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II. STABILITY OF. LINEAR. SYSTEMS 


A. INTRODUCTION 

A review of stability definitions and theorems are set forth in the 
following. Many current references are available for this material 
(H1, Kl, K2, 01, S1, Tl and others). Although. here we are mainly 
interested in linear systems, many of the theorems, as noted, apply 
to nonlinear. systems. The reader is referred to Appendix A for a 
Summary of norms and norm properties. 


Definition 2.1 Free system: Any system with no input 
forcing functions. Such a system is 
described as x = f(x,t) 


Definition 2.2 Autonomous System: A free system which 
is also timesinVvarianes”™ (.er" =F) 


Definition 2.3 Equilibrium state(x,): Any state for 
which x = 0. For a free system 
F(x, st) a 


Definition 2.4 Boundedness: An equilibrium state oe 
is said to be bounded if, and only if, 
there exists a neighborhood N(x, 58) 
such that, if the initial Stete x(t.) 
lies within N, then | |[x(t)-x, 1 | < m< 
for all t> to 


Definition 2.5 Stability (in the sense of Lyapunov): 
An equilibrium state Xe is said to be 
Stable, if, and only if, to. any neigh- 
borhood N(x, 2€) there’ corresponds a 
neighborhood N(x, 26) suchethat, if 
x(t.) lies in N(x, 56) then x(t) lies 
in N(x, 2€) for-aliet > to 


The difference between the last two definitions is important for 
nonlinear systems although the definitions are equivalent for linear 
systems. Note that Definition 2.5 requires that the state remain 
within a’ preassigned neighborhood (which may be arbitrarily small) for 
ale te ty whereas the definition of boundedness requires only that the 
distance (in state space) from Xo to the trajectory x(t) remain bounded. 
Thus, boundedness allows for such things as limit cycles. 


Definition 2.6 Asymptotic Stability: An equilibrium state 
is said to be asymptotically stable if 
()) Iteismsitabies 
(2) | |x(t)-x,|| approaches zero as t 
approaches infinity for any initial con- 
dition lying in the neighborhood N(x,» 6) 
for which it is stable. 


The above definitions are for strictly local conditions about the 
equilibrium state. If the neighborhood N(x, 26) can be the entire state 
space, then the system is said to be globally stable or asymptotically 
stable in the large (ASIL) about the equilibrium state x. 

Uniform stability is also a useful concept. It means that the 
neighborhood in which the initial state must lie is independent of the 
initial time. That is, 6 1S not a function of to: 

Definitions 2.1 through 2.6 above alsu apply to discrete time 
systems with the use of (n,n) in place o* (t,t) respectively. In 
the sequel, theorems and concepts are developed for both continuous 


and discrete time systems. The discrete notation is as follows: 


‘Some authors distinguish between global and stability in the 
large (H1) but for linear systems, they are identical. 
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The System: 


x(n+1) = Ap(n) x(n) + By(n) u(n) (2.1) 
The Observation: 
y(n) = Hp(n) x(n) (2.2) 
The Fundamental Matrix: 
o(n,n) = Ay {n-1) Ay(n-2) oe An(n,) i273) 
The Solution: 
n-| 
x(n) = o(n,n_) x(n_) + 5.) o(nykt+1) B(k) u(k) (2.4) 
¢ 0" ken 
0 
For time invariant systems we have o(n) = A 


D 
Also it is interesting to note the relationship between continuous 


sampled systems and their discrete equivalent,for the time invariant 


CaSe. 


An = (7) B, = wy e@iT=e) Bade (2.5) 


Ba BREE SWSTEMS 

It is well known that for linear time invariant systems, if the 
eigenvalues of the system matrix A all have negative real parts, then 
the system is asymptotically stable. Similarly for discrete systems 
the system is asymptotically stable to the origin if, and only if, the 
magnitude of the eigenvalues of Ay are less than unity. Such condi- 
tions are not easily established for time varying systems. However, 
the following theorems apply for the system x(t) = A(t) x(t), with 
a =a and the equivalent discrete case [ref. S1]. 


Theorem 2.1] The origin of a linear system is bounded 
if, and only if, the norm of the fundamental 
matrix (t,t, ) (or o(n,n.)) is bounded. 


To prove sufficiency note that 


Ix(t)[] = [foctst.) x(t) tls  dfettstovid I ix(to) 4 
If JJe(t,t di] < K 
Then petty <) K| [x(t )[ 
By definition 2.4, with 6 = }|x(t.)I it follows that the origin 


(x,=0) is bounded. Necessity is shown by considering 


Ix(t)]] = [fe(t.t,) x(t I 


and noting that if the norm of o(t,t,) is not bounded then some 
element of o(t,t) approaches infinity with time and hence ||x(t)| | 
also approaches infinity and is unbounded. 


Theorem 2.2 Boundedness and stability of the 
origin are equivalent in linear systems. 


To prove this, we must show that for any e« > 0 it is possible to find 
a 6 such that if x(t) is in N(O,6) then x(t) is in N(O,c) for all 
t > t.. This reduces to showing that if I|x(t.) | | < § then |?Px(t)} |e 
By definition 2.4, if the origin is bounded we may take 6 = —— : 
Then by theorem 2.1 
Ke 


IIx(t)]] < Ki [x(t )| |< = e 





which shows that boundedness implies stability. It is obvious that 
Stability implies boundedness hence the two are equivalent. 


Theorem 2.3 The origin of a linear system is 
asymptotically stable if, and only if, 
the norm of o(t,t) is bounded for all 
t, < t and o(t,t ) approaches zero as 
t approaches infinity. 
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If J|o(t.t, || < K then the origin iS stable (Theorems 2.1 and 2.2). We 
now have to show that ||x(t)|| approaches zero as t approaches infinity. 
Now ||x(t)]| = | |e(t,t,) x(t) | | and if o(t,t) approaches zero then 
||x(t)]| approaches zero. 

Moreover, if ||x(t)|| approaches zero for all I|x(t) | | in N(0,6) 


then o(t,t) approaches zero, since x(t_) is arbitrary in N(0,6). 


0 

It should be noted that if a linear system is asymptotically stable, 
then it is also ASIL, since the conditions for stability depend only 
upon the fundamental matrix and are independent of the initial condi- 
tions of the states. 


The foregoing theorems also apply to discrete systems with the 


change in notation from o(t,t,) to (n,n). 


Gee FORGEDsSYSTEMS 
Now we are interested in studying forced systems of the form 
x(t) = A(t) x(t) + B(t) u(t) (2.6) 
We are particularly interested in the effect of u(t) on the system, 


assuming it is at rest at a Then the solution of (2.6) is 


x(t) = ia o(t,t,) B(t) u(t) dt (2.7) 
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Stability of forced systems is considered with respect to a given 
set of inputs U = {u(t)} . Generally, this set is taken to be the 
set of bounded inputs. 


Definition 2.7 Stability of a forced system. 
A forced system is stable with 
respect to U if, and only if, 
the states are bounded (| |x(t)| | 
K) for all u(t) in U and all 


a 
t> oe. 
are at) 


Theorem 2.4 A forced system is stable with 
respect to the set of bounded inputs 
aad t 
{ |[o(r.t,) Blr)|| dr < ky 
0 


To prove sufficiency note that 
ti 


t 
| x(t) | | oe J]o(z,t,)B(c ule) |] dts FS [fort )B(r)]] | fu(r)| fdr 
0 0 


If |lu(c)|| a then ||x(t)|| < KK 


Hence the state is bounded. 
A consolidating theorem appears in Timothy and Bona [T1] and 
Kalman [K1] with proof given in [K1]. 


Theorem 2.5 For the system described by equation 
(noel t 


(GO elas K, for all t 
(ii) O< K < |[B(t)vl] < Ky for all [Ivi l= 


where v 1S an arbitrary vector, 
then the following statements are 
equivalent. 
1. Any uniformly bounded input 
| |u(t) | | < Kk, results in a 
bounded output ||x(t)|| < Ke < @ 
weer all t > to 


te 
J | }o(t,t)| drs Ke Se 
to 


3. The free system is uniformly 
asymptotically stable. 
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II]. THE SECOND METHOD OF LYAPUNOV 


A. INTRODUCTION 

The foregoing indicates that stability for linear systems is 
completely determined by the boundedness properties of the fundamental 
matrix and the inputs to the system. This means in essence that the 
complete solution to the system must be obtained before stability can 
be ascertained. Lyapunov proposed a method by which stability of a 
system may be determined without finding the solution to the system 
equations. His method is based upon a generalized energy concept. 
From physics it is known that for a nonconservative system, if the 
total energy is always decreasing, then the system is stable. The 
total energy of such a system is a scalar function of the state of the 
system. Lyapunov has shown that the very existence of scalar functions 
of the state of a system which obey certain properties is sufficient 
to conclude that the system is stable. These functions are called 
Lyapunov functions. It is interesting to note that Lyapunov functions 
for a stable system are not unique and that a sufficient condition for 
Stability is the existence of any such function. 


Definition 3.1] Lyapunov function. Any function, V(x), 
having the following properties is 
called a Lyapunov’ function. 


(a) V(x) is a continuous scalar function 
of the systems state vector and has 
continuous first partial derivatives. 


(b) V(x) is positive definite. 


(c) V(x) = [grad v(x)! x is negative 
definite. 


1 


Definition 3.2 A scalar function, V(x), of a vector argu- 
ment is positive definite if,and only if, 
(a) V(0)— 0 
(b) V(x) > 0 whenever x # 0 


Positive semi-definite means that equality 
is also allowed in part (b). 


For negative definite functions the 
inequality is reversed. 


In two dimensional state space the Lyapunov function is a cupped 
Shaped surface setting on the origin. Its continuity and definiteness 
mean that the level curves of V projected on the state plane are 
closed’ about the origin and the curve kK, = V(x) is inclosed within 
mk 


kymet Vix) ifek 


2 ] on 


Under the foregoing conditions we can consider that the Lyapunov 
function gives a measure of distance® from the origin in state space 


as the system follows the trajectory x(t). 


B. THE BASIC STABILITY THEOREM 

| Before stating the theorem, consider the effect of explicit time 
variation upon the properties of the Lyapunov function. Suppose we 
have a function V(x,t) and its time derivative 


aV 


mae ott [grad vy! x (3.1) 


Vipera = 





Now, V(x,t) will be positive for all t if it is always greater 
than a positive function 0(||x||). However, this is not sufficient 


to guarantee asymptotic stability even if V < 0 for all t and x # O. 


“Distance used in this sense is not the same as the usual 
Euclidian distance. Distance is implied by the ordered nature of the 
level curves of V as discussed. 
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Since the grad V term in equation (3.1) indicates the motion of the 


aV 
at 


be sufficiently negative so that V < 0 for all t, while the state 





system state (i.e., 7s multiplied "by x), it jis possible that 


moves outside of any bounding region (N(x, €)). Hence, the state may 
never go to the equilibrium state Xo° Hsu and Meyer [H1] have a good 
discussion and Kalman and Bertram [K1] provide a rigorous proof of 
the following theorem. 


Theorem 3.1] Consider the continuous time, free 
dynamics system x = f(x,t) where f(0,t) = 0 
for all t. Suppose there exists a scalar 
function V(x,t) with continuous first 
partial derivatives such that V(0,t) = 0 and 
(i) there exists a continuous, non- 
decreasing scalar function a such that 
a(0) = 0 and V(x,t) > of(||x||) for all t 
and x # QO. 


(ii) there exists a continuous positive 
scalar function y such that (0) = O and 
V(x,t) < -y(]|x]]) 


(iii) V(x,t) < BC] [x] ]) 
where 8 iS a continuous nondecreasing 
function such that 6(0) = O for all t. 


(iv) a(||x]]) += with [|x|] >= 
Then the system is asymptotically stable in the large to the 
equilibrium state og O and V(x,t) is a Lyapunov function for the 
system. 
Requirement (i) insures that the Lyapunov function is always 
positive whereas (iii) insures that the function does not stay infinitely 
large throughout the state space. In particular it is required that the 


V function does, in fact,go to zero uniformly with time at the origin. 


2] 


The y function in (ii) assures that V is always negative. This 
together with (i) and (iii) assure that the Lyapunov function following 
the system trajectory is decreasing and must end up at the origin. 
To insure that the conditions exist for all of state space, require- 
ment (iv) is imposed and the system is ASIL. It should be noted that 
a positive definite quadratic form for V automatically satisfies the 
requirements of (i), (iii), and (iv). 

For discrete systems we define a Lyapunov function as V(x,n) 


and the rate of increase of V along the system trajectory as 
AV(x,n) = V(x(nt1),n+1) - V(x(n),n) (3.200 


With these changes, Theorem 3.1 becomes the corresponding theorem 


for discrete systems. 


C. THE LINEAR AUTONOMOUS SYSTEM 

For linear autonomous systems, it iS a necessary and sufficient 
condition for ASIL that a quadratic form Lyapunov function exist. 
This is embodied in a theorem originally proved by Lyapunov. 


Theorem 3.2: The origin of a linear autonomous 
system, iS asymptotically stable if, 
and only if, for any symmetric positive- 
definite matrix, C, there exists a 
unique positive-definite matrix Q which 
satisfies the matrix equation 


alg +0A = -C (3.3) 
A full proof of this theorem appears in Hahn [H2]. Essential parts 
also are shown by Bellman [B]1]. The major results of interest are 
delineated here for convenience. 


1. The Lyapunov function is V = x Ox 
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2. By direct computation , 


v= xox + xlox = xJaloxt x! ax 
: (3.4) 
= =x Cx 


3. Given any positive. definite matrix C. then Q is uniquely 
determined by the matrix equation (3.3) if no two eigen- 
values of A sum to zero or no eigenvalue is zero. (see [Bl1].) 

4. If A is stable then the unique Q is given by 


At 


i= & Ge ce ce (3.5) 


Using 4 and a theorem> stated in Browne [B2] it is easy to show 
that if C is positive definite, then 0 must be also. Since C = Bp 
where B is a real non-singular matrix® we can write 
" 


00 00 T 
a= ser telpeAt at = 5 (Beht) (Beh*y at (3.6) 


0 0 


But B and eft are non-singular. Hence Be" is. non-singular and Q is 
positive definite by reapplication of Browne's theorem? . 

Generally the use of Theorem 3.2 proceeds as follows: 
(i) Choose any matrix C which is positive definite and solve the 
n(n+1)/2 algebraic equations (3.3) for the elements of Q. 
(ii) Test Q for definiteness. If it is positive definite the system 
is asymptotically stable. If it is not positive definite the system 
is not asymptotically stable. This follows from the necessary and 


sufficient conditions stated in Theorem 3.2. 


SF rom Browne; If C is any real non-singular matrix and A = el¢ 


then A is’ positive definite. The converse is true, any positive 
definite matrix A can be expressed as such a product. Similarily, if 
C is n square of rank r<n then A is positive semi-definite of rank r. 
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The theorem corresponding to Theorem 3.2 for discrete time systems 
is as follows [ref. K1,01]. 
Theorem 3.2D The discrete time system described by 
x(n+1) = An x(n) 
is asymptotically stable if, and only if, 
for any given positive definite matrix C 
there exists a unique positive definite 
matrix Q satisfying 


Ay QA) -Q= -C (3.7) 
where V = x! Qx 1S a Lyapunov function 
with AV = -x Cx. 

An algorithm for numerical solution of equation (3.7) for Q given 
any positive definite matrix C is derived in Appendix B. This is a 
new result and makes the application of Theorem 3.2D much easier than 
in the past. 

At this point it is important to note that we cannot in general 
choose any positive definite matrix for Q and then perform the multipli- 
cation and addition indicated in (3.3) to obtain a C matrix which is 
positive definite, even if A is known to be stable. The following 


example illustrates this. = 


0 1 
Example 1. For A= and choosing Q =I, 
-2 -2 
it follows that 
T 1 
C = -(A +A) = 
] 4 


The system is stable, but C is indefinite. Therefore, xy is not a 
Lyapunov function for the system described by the given transition 


matrix. 
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The following theorem has not generally appeared in. the literature 
and is formulated here aS a new result. 


Theorem 3.3 If, for a stable system, its transition 
matrix A is real and commutes with its 
transpose (ij.e., aa! = A'a) then 


Y= [peb]® regent 1) 
is a Lyapunov function for the system 
described by A. 

This theorem gives a sufficient condition for the square of the 
norm of the states to be a Lyapunov function. To show this we take 
joel | = (x! x) 178 hence V=x!'x and show that A! +A must be 
negative definite. But ql + A is a real Symmetric matrix and must 
have real negative eigenvalues [B2] if it is to be negative definite. 
But this is to say that the linear system described by the transition 


matrix ql + A is stable. Its fundamental matrix is then 


il il 
6 QA +A)t Now if Ala - aa! then we can write 6 = el t eft 


By Theorems 2.1 and 2.2 of Section IIB, if ||o|| < k then the system 


is stable. But since A is stable Here) | < k,; so we write 


Hol] < kjk, = &k 
For discrete systems the following theorem applies. 
Ii ' T 
Theorem 3.3D If An Ap = Ay An then a 


Lyapunov function for the discrete 
system x(ntl) = Ay en) is 
VY = xx with aV = x! (Ap! Ay yx 


Suppose there exists some transformation of states x = Sy 


] 


such that the transformed system matrix D = S AS is normal 


ie. pip = pp!). Then we’can apply Theorem 3.3 and a Lyapunov 


function for the transformed system is Vy = yy. Since stability 


Zo 


properties for linear systems are invariant under a similarity trans- 
formation,’ we may apply the inverse transformation to My to find a 

Suitable Lyapunov function in the x state space. The transformation 
must also be applied to V. This is summarized in the following theorem. 


Theorem 3.4 If a non-singular transformation 
exists for x = Sr ly such that 
p= sas’! is normal, then a 
Lyapunov function for the system 
x= Ax is: 
y= x sl sx with 
poe (pb! + D) Sx 
Se GR gone 
A similarity transformation may be applied to discrete systems as 


in the development of Theorem 3.4 for continuous systems. 


D. LINEAR FREE SYSTEM 
Here we look briefly at a system of the form x = A(t) x. 

According to Theorem 3.1 of this chapter, if we can find a Lyapunov 
function for the system it may be time varying. However, we may also 
be able to find a Lyapunov function which is not explicitly a function 
of time. Assuming that we can find a positive definite 0 such that 
Vy = x" Ox is a Lyapunov function then 

v= x! cal(t)q + Qa(t)] x = -x! c(t) x 
In order to apply Theorem 3.1, we must find a positive function 
se Xl) "sueh that 

Wigwejer= = 1(||xl |) for adil t. 


Apparently we must somehow remove the time variation of A(t). 


26 


This obviously depends upon the nature of A(t). A general pursuit of 
this problem is not very fruitful. However, for the special case 


considered next we can get some useful results. 


E. LINEAR SYSTEM WITH PERTURBED COEFFICIENTS 

The special case of the linear free system considered here is that 
the state transition matrix A(t) is of the form A(t) = A + G(t). 

Now we assume V(x) = x gx and compute 

Wixst) = -xlox + xl(al(t)q + aG(t))x (3.9) 
where -C = Alo + OA . 

From the results in section 3C, we know that if the system x = Ax 
is stable then for any choice of a positive definite matrix C we can 
compute a positive definite matrix Q. Moreover, if G(t) tends to zero 
as t increases, then there exists some - such that for every t > to 
the first term in the expression for V(x,t) will dominate and V is 
negative definite. Hence, the assumed quadratic form for V(x) is 
indeed a Lyapunov function. In this case we do not have uniform stabil- 


ity since the result depends upon toe 


Example 2. ] 
-| + + 0 
Wet A(t) = 1 
0 -2 + = 
-| 18 + 0 
Then A = G(t) = 1 
0 -2 0 ta 
2 0 
Applying Theorem 3.3 page 25 we take Q=I1 then C = 
0 4 


and C will dominate for all t > 1/2. Thus this system is ASIL for 


any t, >a 


e/ 


For the discrete time system 
x (n+ 1) = Ax(n) + G(n)x(n) 
we again assume a Lyapunov function V = x Ox. Then by direct compu- 
tation 
aV = V(x(nt1)) = Vix(n)) = =x" (n)ex(n) + x"(n)Q6(n)x(n) 
where -C = a'oa - Q 
From Theorem 3.2, page 22, we know that for any positive definite 
C we can find a positive definite Q if A is stable. Similar comments 


apply if G(n) tends to zero as for the continuous case. 


F. FORCED LINEAR SYSTEMS 

The usual application of Lyapunov's Second Method is to the study 
of the dynamics of homogeneous linear systems or to the study of the 
dynamics describing the motion of a system about some nominal trajectory 
usually attributed to the forcing function. In this section we consider 
a different approach which leads to a novel interpretation of the 
Lyapunov function with inputs present. 


Consider the linear system 


tees t) (3.10) 
Again assume a Lyapunov function V = x! 9x and compute its time 
derivative. 

y= x1 gx + x" Ox 


= (ulg! + x Al yox + x orax + Bu) 


x alg + QA)x + u'Blox + x Bu 


T 


Vo = «x! cx + 2x!oBu (3011) 


From Section IIC we know that if the system x= Ax is stable we 


can find a positive definite Q for any given positive definite C such 
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that -C = alg + QA. Also, note that the bilinear forms ule! ox and 
x OBu are transposes of each other. Since they are scalars, they must 
be equal, and they may be combined as shown in the second term of 
equation (3.11). 

By definition 2.1 in Section IIC page 13 the system described by 
(3.10) is stable if and only if ||x|]| < k,. Thus if we can show that 
V given by (3.11) is negative for ||x|| > k, we can be assured that 
the Lyapunov function will converge to a region determined by | |x| | < ky. 
This result is stated in the following theorem, which has not been 
stated explicitly in the literature heretofore. 


Theorem 3.5 For the system x = Ax + Bu(t), 
if the homogeneous system is ASIL 
and has a Lyapunov function 
V = xox with VY = -x'cx and 
imines) < ky for all t, then the 
system is stable and the state is 
sure to enter a region defined by 


2k |1Q1| 1IBI] ~~ 
== a 
Ay 
where My = rman ©) the minimum 


eigenvalue of matrix C and « < 0 
with |e| arbitrarily small 
Proof: The system is stable by definition 2.1 of Section IIC. Take 
as the system Lyapunov function V = x Ox then 
v= -xlex + 2x! qBu 


: 
< -xCx +2 [[x]] PlQi] TIBI] [lull 
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; 4 
also since 


ay LIxLI  < x'cx | (3.12) 


2 
certainly V< -24/[x]|~ + @ko] |x] ] [Ql] TIBI] 


To complete the proof we must show for all x such that |{x|| > ky 


then ats 


When ||x|| > k, , then 
be (ON BN = ee 
||x|| > ———1 


Ay 
2 x 
and an | [xl] > 2kol Ix] [1QI] 1/81 = 


Thus 0 > © > agl[xl|* + 2kolixl] {1Ol| [1B] | > ¥ 


It follows that V is negative when ||x|| > k, , indicating that the 
systems response is leaving the region. 

The results of this theorem only give an upper bound for the region 
to which the state will converge in the steady state. Unfortunately, 
this bound depends upon the Lyapunov function chosen. However, the 


bound is determined without finding the solution to the system equations. 





Example 3. Consider the stable system 
-| ] ] 
xX = nr SiN wt 
-| - 0 
-| ] ] 
A = B = 
-)] =!) 0 
4 x! Cx 
Relation (3.12a) is derived using the Reyleigh Quotient * 
See Bellman [B3] p. 110. iia 
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and compute a steady state bound for the 
state of the system. 


Theorem 3.3, page 25 applies to the homogeneous dynamics, therefore a 


Suitable Lyapunov function is 


oo = XY = Tin + ql) = =X Toy 


that is, oO =f f 4 


and the homogeneous system is ASIL. The results of Theorem 3.5 are 
used to compute an upper bound for ||x|| in the steady state. The 


parameters for equation (3.12) are as follows taking for the matrix 


M = (m, 5) >; [({M|| = Cer(mm!)]'/2 (see Appendix A) 
[lu(t)|| = [sin ot]| <kp = 1 

wish | eu. = 

Amin ©) a 


e <0 and |e| jis arbitrarily small 


It follows that 
2k/ {Ql | 1B1| 


xl] < ke = 
‘9 
It should be noted here that if C = 4! the computed bound kK, 
is independent of dys This follows because -C = alg + QR. «At fer 
C= I1 the resulting Q is Q): then for C = yl the resulting 0 jis 


49. Thus nS cancels out in the expression for ky. 
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The same theorem holds for discrete systems with appropriate 
changes in notation. 

It is important to note that, for linear systems, the forcing 
functions have no affect upon the asymptotic stability properties of 
the system. They do however, influence the behavior of the system 


in the steady state. 


G. TRANSIENT ESTIMATION 

Since the level curves of V give, in some sense, a measure of 
distance from the origin, V may be used to estimate the convergence 
rate of asymptotically stable systems. The following is taken from 
Kalman and Bertram [K1] with some minor additions. 


Consider the obvious inequality: 


Wxst) = WO voxyt) < -ny Wot) (3.13) 


where ny is the minimum of the ratio - Bey In some region 


(radius r) of state space excluding the origin. In cases where the 


basic stability theorem applies (Theorem 3.1 page 21) we may take” 


_ int GBH 5 0< I IxI| < ") (3.14) 


Integrating equation (3.13)observe that 


ie y E 
if ~——— @dt> < -« i Gat 
V a 1 
t t 
0 0 


i PT ly) ae ae oe implies the greatest lower bound 
of f(y) for i" SMe Uhlaite Ue sally Hj: 
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-11(t-to) 


V(x(t),t) < V(x,»t)) e (3.15) 


The number, >» 1S an upper bound for the time constant 


ale 
describing the convergence of the Lyapunov function following the 
system trajectory. If the system is linear the Lyapunov function is 
a quadratic form and “ty is the corresponding bound for the con- 
vergence of the system trajectory. 
Similarly we can write V > -njV where no jis a maximum of the 
ratio - = giving a lower bound for the system time constant. 
Kalman [K1] has carried out the minimization to obtain ny 
fo: the linear time invariant case. These results are stated here. 


If the system X= Ax is ASIL and has a Lyapunov function v= xox 


with V= -x! Ex we take 


Dye = ees v=1) (3.16) 
The minimization leads to 

ny = dma (C0) (3.17) 
Similarly 

ny = may (CO) (3.18) 


Figure 1 illustrates the relationship of V(x(t)) and those functions 
formed by the n bounds. 

These convergence estimates depend upon the Lyapunov function 
chosen. However, there are certain cases where the estimates are 
identical to those obtained directly from the system matrix A. The 
following new result is stated as a theorem the proof of which uses 
a result indicated in Bellman [B1] involving the eigenvalues of 


rational functions of several matrices, as follows: 
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FIGURE 1 


Lyapunov Function and convergence bounds 
following the system trajectory x(t) 





FIGURE 2 


Effect of system parameters G on V. 
G minimizes V 
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let A and B be commutative matrices and 
f(x,y) be a rational function. Then the 
re enyeen roots of f(A,B) are 
f(a;,b.), where as and bj; are the 
a roots of A and B in an 
ordered sequence which depends upon the 
matrices A and B. 


Theorem 3.6 If Theorem 3.3 of Section IIIC page 25 
applies and Q is chosen as the identity 
matrix then -n, = 2 an (A) and the 
convergence estimate using Lyapunov 
functions is identical with that of the 
actual system. 


The proof is immediate since if N= I1 then C= -(A+ Al), 


+ 


Taking B= A in the above result and noting that d (A) = aa (A), 


we have that ra (A + A) = 2). (A). And the 2....00) = 22..00), 


min min 
The utility of this method is obviously not in estimating the 


convergence of linear time invarient systems since one eigenvalue prob- 
lem is replaced by another. Its utility in this thesis is in application 
to time varying systems as illustrated in the following example. 


Example 4. 1 
a hilt 0 


fee Ae) = 
0 ~( 2+) 


It is desired to estimate the convergence 
rate for the system described by the given A(t). 
Write this as 


| 


A(t) = A- Gt) = - 


c+|— 


By applying the lemmas developed in Section IV C which follows, it can 
be shown that a suitable Lyapunov function is (see example 6 Section IV C) 


ve xix P= -x' (a+ AW - oll Teey)x (3.19) 


r 


A+A’)x - x (G(t) +6 


-y = -x! Cx where -C = A+A 
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For this example 


2 0 
Q =] C = 
0 4 
min 2y(|{x|1) . i 
ne a sop ae OX 2 -l, _ 
; “ ll "sy 8} = AmpnfCl J = 2 


Suppose that V is a function of a matrix of parameters G, as for 
example, the G(t) in equation (3.19). Now if for some G = G ; 

V is a minimum with respect to G then the ratio is also a 
minimum, hence, ny» is maximum (see eq. (3.13)). It follows that if 
ny 1s maximum for G = G then the system involving Gg converges 
faster than systems involving other G's. Figure (2) illustrates this 
concept of a minimum ratio (6) which also appears in Section V 
in connection with the derivation of the optimal filter gain. In this 
case it is possible to choose G(t) so as to maximize the system con- 
vergence rate. 

Corresponding results to the foregoing are established by Kalman 
in [K2] for discrete systems. The main difference is that a recursive 
relation for V(x(n)) is established as 

V(x(n#1)) < e7"V(x(n)) 
where T is the sample period and n_ is the corresponding constant in 


the continuous case. Again the minimization leads to 


e7enyl (co7!) 


min 
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H. CONCLUSION 

General results are difficult to obtain using the foregoing methods 
for time varyina linear systems. Often the results qive sufficient 
conditions only, and upper bounds which depend upon the chosen Lyapunov 
function. However, there is an advantage in that these results are 
always obtained easily without solving the system equation. 

The approach of restricting the Lyapunov function to be explicitly 
time invariant was considered and serves as an introduction for a unified 


approach to the special system problem considered in the next chapter. 


o7 


IV. THE HOMOGENEOUS FILTER OBSERVER EQUATION 


A. INTRODUCTION 

The remainder of this thesis is concerned with the study of 
the basic recursive averaging filter-observer using’ the foregoing 
concepts of stability theory. 

The filter problem is usually formulated by considering a signal 
model driven by white noise and whose output measurements contain 
additive white noise. Such a model is described by 

x = Ax + w(t) (4.1) 
Zee Vet) 
where A is an nxn state transition matrix, H 71s an mxn measurement 


matrix, w(t) and v(t) are n and m vectors respectively of uncorrelated 
white noise. 


Filtering is accomplished by recursively operating on the 
ovservations z to estimate the values of x. The estimates are 
denoted x. The filter dynamics are given by 
x = Ax + G(t)(z-Hx) (4.2) 
where G(t) is the nxm filter gain matrix the choice of which determines 
the filter characteristics. If the error, e = XX, is formed and 
the expected value of the error squared (P = E[ee!]) is minimized 
to determine G(t) then (4.2) becomes the optimal integral squared 
error filter derived by Kalman and Bucy [K3]. If G(t) is determined 


by some other method, the filter is suboptimal~(for example see S2). 


The optimal gain is given by 


G(t) = P(t)H'R™ (4.3) 


where P(t) satisfies the matrix Ricatti equation 
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J Tp-l 


-P(t)H R HP(t) + W (4.4) 


p AP(t) + P(t)A 
and R Sewer dees vonrlse 


Suppose we rewrite equation (4.2) so that it is in the form of 
equation (4.1). We obtain 


X = (A-G(t)H)x + G(t)z (4.5) 
This equation is interpreted as a system having a state transition 
matrix, A-G(t)H, with the input vector z. Thus, we are indeed 
taking the observations z and filtering them to provide the output 
state vector x. 

Let us also find the equation for the error dynamics. Define 
the error, e = X=X, to be the difference between the actual states 
and the estimated values out of the filter. 

@ = X-% = Ax + w(t) - Ax soG(t)(z-Hx) (4.6) 
Noting from (4.1) that z = Hx + v(t) we write 

é = Ae - G(t)[Hx + v(t) - Hx] + w(t) (4.7) 
and e = (A-G(t)H)e - G(t)v(t) + w(t) (4.8) 

Observe that the homogeneous dynamics for the filter equation 
(4.5) and the error equation (4.8) are identical. Both are described 
by the state transition matrix A - G(t)H. Consequently the results 
of this section apply to both the output of the filter and to the 
deévual error. 

Now suppose that G(t) is a matrix of constants, Ge. Then the 
filter transition matrix becomes A - GAH and the problem reduces 
to an ordinary linear time invariant case. Moreover, if G = I 


the filter-observer equation reduces to that form derived by 


a 


Luenberger aie for observation of states. 


In discrete time problems the signal model and optimal filter 


equations becomes respectively: 


x(nt+1) = o(T)x(n) + wn) 
Zinpee= = =HX(n) Fein) 


X 


n) 
na P(n/n-1)H'CHP(n/n-1)H" + Ry"! 
n/n) = P(n/n-1) --G(n)HP(n/n-1) 


ntl/n) = oP(nfn)o! + W 


o(T)x(n-1) + G(n)[z(n) - Ho(T)x(n-1)] 


( 
G( 
P 
P( 


(4.9) 


(4.10) 
(4.11) 


where the notation P(n+l/n) is the estimated covariance of error 


at time n+] given the previous n corrected estimates. 
For discrete systems, we find equations 
x(nt1) = (0(T) - G(n)HO(T))x(n) + G(n)z(n) 
e(nt+]) = (0(T) - G(n)Ho(T))e(n) - G(n)v(n) + w(n) 


(4.12) 
(4.13) 


Again the unforced dynamics for the filter states and the estimation 


error are the same. 


Bona [B4] has considered the problem of choosing the eigenvalues 


of the matrix (o(T) - G(n)Ho(T)) to give a desired convergence 


property (namely, exact estimate, of the observation in a finite 


number of sample periods for noiseless systems) leading to the 


specification of G(n). He has found that the observer can be made 


to converge as rapidly as desired. However, the observer is highly 


susceptible to error due to the presence of noise. 


: Luenberger [L1] has shown by using linear transformation 
theory that unobserved output states of a free linear 

System may be observed by passing the observed output 

States through another linear system whose transition 

matrix iS A-H, provided A-H is stable. 


40 


The filter gain matrix G(t) or G(n) has a twofold purpose in 
the filter observer equation. 
1. To provide observable outpus of unovserved states. 
2. TO provide some programmed weighting of the actual 
observations in an attempt to compensate for 


errors introduced by noise. 
Further-properties of G are studied in the following sections. 


Be FILTER STABILITY 

Stability of the optimal filter has been demonstrated in general 
by Kalman. A more general statement of the following theorem 
appears in Kalman and Bucy [K3]. 


Theorem 1 (Kalman) [K3]. For a linear time invar- 
jant signal model also having stationary 
Statistics, if the model is: 
(i.) completely observable 
(ii.) completely controlable 
then the optimal filter 1s asymptotically 
Stable. 


It is evident that even if the signal model equation (4.1) 
section IV. A is unstable, the filter may still be asymptotically 
stable. That is, the matrix (A-G(t)H) in equation (4.5) has 
its own stability properties. Thus, it is not necessary that the 
Signal model be stable for the filter to be stable. 

Deyst and Price [D1] have derived conditions for asymptotic 
Stability of the discrete optimal filters by following the previous 
work of Kalman for the continuous optimal filter. Their method 
uses a time varying Lyapunov function namely choosing Q = p~! (n) 
where P(n) is the discrete covariance of errors. Thensby applying 
the conditions for controlability and observability, they found 


the desired bounding functions needed to apply the basic Lyapunov 


4] 


stability theorem. The techniques developed in this paper, namely 
requiring the Lyapunov function to be time Invariant, generally 
apply to a more limited class of signal models {i.e., asymptotically 
stable ones). However, the method is also much simpler and applies 
to suboptimal filters as well. Such a study leads to insight into 
what the filter gain must be to insure a stable filter and may in 
some cases be used to actually choose the G matrix requiring that 
the filter be stable. 

Generally, the technique proposed here may be outlined as 
follows: 

Assume that there is a Lyapunov function of the form V = e! Qe 
then compute 

v = el(alg + aaje - el (Ha! (t)Q + QG(t)H)e (4.14) 

In some cases equation (4.14) may be used directly to determine 
the matrix Q. nore in other cases,f) may be determined from 
-C = alg + QA by applying Theorem 3.2 page 22. If A is asymptot- 
ically stable then V of (4.14) becomes 


ew elcawees! (uc! 


(t)Q + QG(t)H)e (4.15) 
If equation (4.14) is negative definite for t> ty then the 

filter is ASIL by theorem 3.1 on page 22. Negative definiteness of 

V may be shown for all t > to: Alternately it is possible to find a 

bounding function, y(||x||), as required by theorem 3.1. Obviously, 

if the second quadratic form in equation (4.15) is at least positive 


semi-definite for all t > t, then, V < -e!Ce and the conditions 


of theorem 3.1 are met. 
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The fact that the filter is ASIL means that considering the 
filter as a linear system free of input forcing functions, the 
error starting at any arbitrary initial value will tend to zero 
as t increases. 

This tells us nothing about the actual estimates or error. 
These questions must be considered with the input present which 
due to jts stochastic nature, limits the asymptotic value of the 
effective error. Such consideration appears in Chapter \V. 

For discrete systems, taking the same Lyapunov function as 


for the continuous case and using Theorem (3.2D) page 24, we obtain: 


av(esn) = e'[o'(1 - G(n)H)'Q(I - G(n)H)o - Qle (4.16) 
AV(e,n) = =eeer- e!(o!0G(n)He + o!H'o! (n)oe)e 
te! oH! Gg! gGHoe (4.17) 
and av = -e!ce = 2e'(o'ga(n)Ho)e + e!o!H! a! oGHoe (4.18) 
when -C = aNie - Q 


The following serve to illustrate the kinds of filter problems 
considered in this thesis. 
]. Filter Gain Constant 


Case | We are given G(t) = G and G is a matrix 
of known constants. For this case we may 
take V = e! Qe and rewrite equation (4.14) 
as 
Vv = e/((A-GH)'Q + Q(A-6H))e (4.18a) 
Since A,H and G. are all known, we may 
apply Theorem (3.2) page 22 directly to 
determine Q. An algorithm for numerical 
generation of Q is derived in appendix B 
for the discrete case (Theorem 3.2D). 
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Case 2 Choice of G. so that the filter is stable. 
Now use equation (4.15) for V and determine 
Q based upon the signal model by applying 
Theorem 3.2 page 22. This requires that the 
Signal model be asymptotically stable if 
a suitable Q is to be found. 


The second quadratic form in equation (4.15) is now: 
e'(H'G.'Q + QGH)e = e'De 
p = He '0 + OCH (4.19) 
g G 
lef Ge can be chosen so that the matrix D in (4.19) is 
at least positive semi-definite(p.s.d.), the filter is sure 


to be stable since the first quadratic form in equation 


(4.15) is already negative definite. 


Example 95 


Consider again the system described in 
Example 3, page 30. Where 


-| | ] 
A = B = 
-| =] 0 
ao 
= eal Ca 
0 Rez 


fet H=tP OF 


In case 2, it is desired to choose G. SO 
that the matrix, D, in (4.19) is at least 
positive semi-definite, which is a suffi- 
cient condition to insure that the filter 
be stable. In order that H and G. be 
conformable, G. must be of the form 


g 
quale fe 
¢ 

92 
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then 

Io, 5 
GH = 

Gg, 0 


and from (4.19 


Sy 
2g 
= : 
2 40 
The condition that D be p.s.d. is that its 
principal minors be non-negative [ref. B2]. 
That is 
29, > 0 


Obviously Go = QO and gy > Opesada gi, thie 
conditions. Therefore, any filter with 
ae 
will be ASIL. 
Now suppose we are given 
1 
G = 
: 0 
Find the convergence bound ny From (4.18a) 
v = el(mlg + ame 


where 


We may compute the Lyapunov function Q from 
theorem 3.2 for any p.d. matrix C such that 


-c = mo + 0M 


But in this case vi! - mM and we may use 
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theorem 3.3, page 25. Thus 


1) ; +4 0 


= ol C = -(M+tM 
0 +2 


The convergence bound, 1, ; is given by (3.17) 


x = we - 
re Amin (CQ be =H) Fae = @ 


2. Special Time Varying Gain 


Let the filter gain matrix be time varying such that 


lim G(t) = 0 a 20) 


t+ 

the null matrix. Here we have a problem very similar to that 

discussed in section III E, and those comments apply here. Also 

if the matrix 
(H'G'(t)Q + QG(t)H) (4.21) 

in equation (4.15) is at least positive semi-definite for all 

t > tos then we may not need the requirement of (4.20). Sufficient 

conditions for the matrix (4.21) to be positive semi-definite are 

developed in section IV C which follows. 

So) iemePulmalnn | Iter 


Substituting the optimal gain G(t) = P(t)H! 


R! into 
equation (4.15) we obtain 
v= ef (alg + gaye - e (HR Hp(t)g + gp(t)H TR Hye (4.22) 
If the signal model is asymptotically stable, Q may be determined 
for any p.d. matrix C as before and (4.22) becomes 
v= -el(c + H Ro HP(t)o + ap(t)H R™ Hye (4.23) 
Now the condition for stability from (4.23) is that 
F(y,t) = yl(C +H'R™'HP(t)Q + QP(t)H'K™ H)y > 0 (4.24) 


for all t. Moreover, if 
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mre hy at) > 
t 


then the condition will hold for all t. From ordinary calculus 
a minimum occurs for t = ty late 
2 
and d flysty) > 0 
dt dt 


Taking the derivative of equation (4.24) yields 


F(y,t) = y (HR H(t) + Qb(t)H RT H)y = 0 (4.25) 
One way for (4.25) to be zero is for 
P(t) = 0 (4.26) 


Kalman [K3] has shown that condition (4.26) yields an asymptotic 
or steady state value for P(t) in the limit as t approaches infinity 
by solving the Ricatti equation (4.4) page 39 for P with P = 0. 
However, for this condition, lim P(t) 1s also zero, hence, the 
foregoing does not in general lead to the desired minimum. Only 
in the trivial case where P(t) is diagonal does this method lead 
to the desired stability condition, since the diagonal elements of 
P(t) are monotonic decreasing to the limit. 

The elements of the matrix in (4.25) are,in general, linear 
combinations of all the elements of P(t). There may be some 
t= t, which will make (4.25) zero other than in the limit as 
t approaches infinity. However, ty 1s generally difficult to 
find, even for a specific example. Often it is necessary to compute 
the trajectory for all elements of P(t); then, using this, study 


the properties of equation (4.24) at various values of time in an 
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effort to determine t). Besides being difficult, the method ceases 
to be useful, since, once P(t) is known for all t, complete knowledge 
of the filter is at hand. 

Further research is required in the area of finding a 
suitable bounding function, y(|je||), for equations (4.22 or 4.23). 

For suboptimal filters one might choose G(t) in equation 
(4.15) in order to establish the stability condition. The next 
section attempts to build some insight into the nature of G(t) 


required to insure a stable filter. 


C. STABILITY CONSTRAINTS ON G(t) 
In the last section we have found for the filter error dynamics 
that if, @ = e!Qe is a Lyapunov function then 
V = -elce - el (ual (t)q + QG(t)He (4.27) 
i -e! Ce - 2e' (QG(t)h)e (4.27a) 
where -C = alg + QA and Q may be determined if the signal 
model x = Ax is ASIL. Equation (4.27a) is written as shown 
for convenience even Praia the second term is not a true quadratic 
form (i.e., the weighting matrix is not necessarily symmetric). 
In this section some sufficient conditions are found which insure 
that V in (4.27) be negative for all t. If these sufficient conditions 
hold for a given G(t) then the filter is shown to be ASIL. On the 
other hand, one may choose G(t) in order to make the conditions apply, 
thus designing a filter which is known to be ASIL. 
In applying the basic stability theorem of section III B, we 
must find a positive function Y Such that 


Weve) < - TAGES || more al t 


— 
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Equivalently, we may use a quadratic form for the bounding 
function of V, that is, we must find a positive definite matrix, 
B.Such that: 

ViCeRt) aS e'De =e (4.28) 
Applying this to equation (4.27) we require: 

e'fc + H'a!(t)Q + QG(t)H]e > e'De (4.29) 
At this point we introduce inequality notation for symmetric 
matrices. Namely, A > B means’ that (A-B) is positive semi-definite. 
If equality is removed, then (A-B) is positive definite. Writing 
relation (4.29) in this notation: 

c+H'e'(t)Q + QG(t)H > D forall t (4.30) 

The matrix D is important if one is to use the technique 
described in section 3G to estimate transient response of the filter. 
The form y= e'De replaces V in the ratio V/V used to determine 
ny hence estimating the convergence rate of the filter. Large D 
means that the gradient of yis steep, hence, the convergence bound 
is better since the actual filter converges faster than the estimate 
given byn,. Thus, if G(t) is known, then it is desired to choose 
the largest D which will satisfy (4.30). On the other hand, if we 
wish to design the filter to converge faster than some given bound, 
then D is given and (4.30) provides a constraining relation for 
G(t). 

If we rewrite (4.30) and apply the properties of norm to both 
Sides, we get another bounding relation for G(t). 


He! (t)q + QG(t)H > D-C (4.31) 
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Ha (t)Qi| + [[QG(t)H|| > D-c 
21/Q]1 [acted] | | JH] ]> | | d-c] | 


Gayle ie for all t. 4.32 
| | G( le ee or a (4.32) 


Intuitively equality in (4.31) and (4.32) should result in a 
minimum condition of the left hand sides for all t. That is in 
ler. 32) 


min ||G(t)|| = D-C (4.32a) 
i Zalul:Q H 


A similar interpretation of the equality statement for (4.31) 
is not so easily conceived. 

The basic requirement for stability given by (4.30) or (4.31) 
is that 


min Ave (He (t)o + OG(t)H-D+C) = 0 (4,33) 
t 


Application of (4.33) is very difficult. However sufficient conditions 
to insure that (4.33) does hold, may be found by applying an important 
theorem involving bounds on the eigenvalues of a matrix. The Hadamard- 


Gerschgorin theorem is stated here as found in Bodewia [B3]. 


Theorem (Hadamard=Gerschgorin). 


The eigenvalues of the complex matrix 
A = La; J lie inside the closed domain 
G consisting of all circles K. (3 = 1,...,n) 
with centers . and radii r where 


r, = Ja, . 


Ta 
If we apply the H-G theorem to a real symmetric matrix and 
require that ani 2) lag for all i then we can get a sufficient 
Sti 


condition test for positive semi-definiteness. 
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Lemma 1. Let A be a real symmetric matrix and if 


Gis. > ‘ ; viata 
ii = aoa for all i, then A is positive 


semi-definite. 
The proof is straight forward application of the H-G theorem. 
Since A is real symmetric its eigenvalues (A(A)) must be real. 
Then applying the H-G theorem, the eigenvalues of A must lie between 


ee aee te Pe Mm = > : ie 
vin (a... rs) and ws (a. ) where Yr, 794 as 


illustrated in Figure 3a. But the hypothesis requires 


ae aol = 5 la. .| 
il Si J 534 ie 
implying 
Cee eae mT | 
In particular, min (angi awl > ale 


‘ BP OP ae 
which implies that the eigenvalues of A must be non-negative. Hence, 
A must be positive semi-definite and the lemma jis proved. 
Two lemmas will now be proven which will provide a method by 


which we may determine the best bounding matrix D. 


Lemma 2. Let: 
a. M(t) be a real symmetric matrix of elements 


m,« (t) 
jFi 
c. a,(t)>0O for all i and t 
Then M(t) is positive semi-definite for all t. 
The proof is immediate from Lemma 1. Figure 3b illustrates 


the a(t). 


Sl 


must lie 


All hs 
| in this interval 


ImL AJ 





FIGURE 3a 


Hadamard-Gerschgorin Theorem applied to 
a real symmetric matrix 


Peat) eee = Me al aiie 


1 a aE | 
J JA "J 


Im{ 2] 





FIGURE 3b 


Illustration of the o.(t) of Lemma 2 
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As a result of lemma 2, if from equation (4.31) we set 


m(t) = He! (t)Q + Qe(t)H (4.34) 
then M(t) > D-C for allt (4.35) 
lf Sant a(t) = 0 for any i then then condition for stability 


i 
is obtained by choosing D = C. However, if a(t) oa 86 foraildet, 
then we should be able to choose some D> C _ giving rise to a tighter 
restriction on G(t). 


Lemma 3. In Lemma 2 let: 
a. a (t) = min 0. Game gere sie 


b. ay (ty ) = nin ay (t) > 0 
c. m,.(t) - m..(t)) > > im, (t) - m; 3(ty) | 
j#i 
Then M(t) > M(t, ) with equality only at t = t,. 

The proof of Lemma 3 1s by construction. Choose t, as specified 
in conditions (a) and (b), thus insuring that the smallest eigen- 
value of M(t) will be positive for all t. For the matrix (M(t) - M(t, )), 
use (b) in Lemma 2 to compute condition (c) in Lemma 3, and the 
lemma is proved. 

It is apparent that the conditions in Lemma 3 may be used to 
insure that equation (4.33) holds. That is, by choosing M(t) as 
in (4.34) and the matrix M(t, ) as in Lemma 3, we may rewrite 
equation (4.33) as an inequality since Lemma 3 only specifies a 
boundary for the eigenvalues of M(t). The result is 
main l(t) -D+C)) > 0 (4. 36%) 


then daa (M(t) ) - Dea} 0 (4.36a) 


er min A 
ic 


a2 


Now if Lemma 3 applies, then we are assured that (4.36a) is true 
for some choice of the matrix D. However, we may also apply Lemma 3 
directly to (4.31). This results in 

M(t) eect *DgerC (4.37) 
Since C is given and D may be computed directly, by replacing the 
left side of (4.37) with its minimum, M(t.) , to establish the equality 
M(t,) = D-C (4.38) 
This follows, since by Lemma 3, M(t) > M(t). 

On the other hand, if D is given, as established by some con- 
vergence bound, ny)» then the conditions in Lemma 3 may be applied to 
(4.37), thus, establishing constraining relations on the elements of 
M(t). 

We are really interested in the matrix G(t). From (4.34) it is 
obvious that every element of M(t) will in general be a linear combi- 
nation of all the elements of G(t). This follows, since H and Q 


are determined by ‘the signal model and are constant matrices. Thus, 


in general 
This may not be difficult to apply to specific examples since H and 
Q are often very simple matrices and many of the bs = 0. 
The following illustration refers to Example 4, page 35. 
Example 6. As in Example 4, consider the system described by 
the transition matrix 
1 
=a ay ae) 0 
A(t) = 
0 -(2 + —-) 
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This is equivalent to a filtering problem 
where the signal model is described as 


follows: 
-| 0 Wy 
The system: X= x + 
0 -2 Wo 
| 0 V5 
and observation Zz = x + 


0 
-] 0 
where A H = J 
0 -2 


and W1> Wos Vas Vo are white Gaussian noise 
inputs. The homogeneous filter dynamics are 
given in (4.8) as 


rah SSHRC ED cu Nat me 
where 
+ 0 
G(t) = 
On mgt 


By using Lemma 2 we will derive the Lyapunov 
function given in equation (3.19) of Example 4. 


First, assume a Lyapunov function of the form 
v = ele 
then by equation (4.15) 


+ 


Vv = -elce - ef (nie! (t)o + aG(t)H) e 


< -e'De 


where Q is determined from -C = Alo + QA. But in this case aa! = ATA 


and we may use Theorem 3.3, page 25 to find Q and C . The result is 


5b 


Now in order to apply Lemma 2 we find from equation (4.34) 


2 
T —— 
M(t) = §@°(t) + @(t) = 
0 = 
t 
This follows since Q=He=I. 
Then by Lemma 2 we must have 
J ein 
1 UR op = FE ead for all t. 


Obviously, a>0 for any t>t,>0. Since ‘0 a(t) = 0 in the 
limit as t approaches infinity, the condition for stability is ob- 
tained by choosing D=C. That is, -e'Ce is a Suitable bounding 
function for V.. Hence, V is negative definite and the filter is 
ASIL by Theorem 3.1, page 2l. 

Lemma 3 applies to problems where the a(t) in Lemma 2 are greater 


than zero for all t. Consider the slight extension of the above exam- 


ple by letting 
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Take t, in the limit as t approaches infinity. Then from conditions 


] 
(a) and (b) in Lemma 3: 
okt ) a (t 


It follows that 
2 0 


My? ya 


and condition (c) Lemma 3 reduces trivially to Lemma 2. Using equation 
(4.38) we may compute the matrix D 
M(t, ) ee 


By using the results of Section III G we may compute a convergence 
bound for the filter employing the given G(t). 

The problem of using the foregoing Lemmas to obtain constraints on 
the elements of G(t) giving rise to a class of suboptimal filters is 
deferred to Chapter VI. 

The foregoing Lemmas apply to discrete systems simply by replacing 
t with n. Now M(n) is specified from equation (2D) in Section 4B. 
Namely:  M(n) = @!96(n)Ho + o/h! G!(n)ne 


(4.40) 


~ oy! @! (n)og(n)He 


However, (4.40) may be simplified. This follows since © jis non- 
Singular and appears as a congruent transformation on the inclosed 
matrix and does not affect the sign definiteness of the inclosed matrix 


[ref. B2]. So in effect we may take 


M(n) = QG(n)H + He! (n)o - H'G! (n)oG(n)H (4.41) 


yj 


D. CONCLUSIONS 

In this section we studied the homogeneous dynamics of the basic 
filter-observer equation (4.5) and its error dynamics equation (4.8). 
Since these homogeneous dynamics are identical, the results of this 
section apply to both. 

The basic approach in this section is to assume a time invariant 
Lyapunov function and compute its time derivative, V, subsequently 
showing that V is negative definite for a given filter gain matrix 
G(t). This was found difficult to do in general terms. Often it 
is possible to determine the quadratic form for V based solely 
upon the signal model. This requires that the model be ASIL. 

Kom) 3 ‘ae to choose the Lyapunov function based only upon the 
Signal model if one wishes to compare the convergence of two 
different filters designed for the same signal model by using the 
estimation techniques described in section III G. Under these 
conditions the Lyapunov function for the two filters is the same, 
Pir the eeaciatedtt 4c modified according to the particular filter 
(see equation (4.15)). 

The techniques developed in this chapter cannot be readily 
applied to the optimal filter. This is left as an area for addi- 
tional research. 

Several constraining relations between the bounding y function 
required for stability and characteristics of the gain matrix were 
developed in section IV C. In general these relations are difficult 
to apply, but in some simple cases they lead to a method of deter- 


mining G(t) to insure a stable filter which converges faster than 


some desired bound. 
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V. THE FORCED FILTER EQUATION 


A. INTRODUCTION 
In this chapter the forcing functions are’ included in the 
filter dynamics. From the study of linear forced systems in 
section III F, recall that the forcing functions do not affect 
the asymptotic stability of the system dynamics. They do, however, 
determine the steady state properties of the system. Such is also 
the case for the filter-observer system, and these properties 
are investigated in this chapter. 
The error dynamics for the filter given in equation (4.8) 
are repeated here 
e = (A-G(t)H) e - G(t) v(t) + w(t) (S..i)) 
As in chapter IV, assume a Lyapunov function of the form 
1 e! de (5. 2s 
then by direct computation using (5.1) 
= a! oe + e! a6 


e'(a'qg + Qa) e - e (Ha! (t)q + QG(t)H) e 
" aw! (t)Qe - ov' (t)a! (t)oe 
V = -elce - el (n'a! (t)q + QG(t)H) e 
ft ow! (t) Oe - ov! (t)a! (t)oe (5.3) 

where Q is determined, as before, for any given p.d. matrix C by 
the signal model which is required to be ASIL. 

It should be noted that the only difference between equation 
(5.3) and equation (4.15) is the addition of two terms involving 


the forcing functions w(t) and v(t). However, this particular 


ao 


application is different because the forcing functions are random 
vectors. To consider this aspect we use the expectation operator’ 
and rewrite equation (5.3). Since Q is*completely determined for 
any choice of positive definite C, we write this equation in terms 


of Q. Applying the matrix identity (5.4) to (5.3) 


x My a tr[Myx!] = trLyx!M] Sac) 
results in 
V = tr[Q(A-G(t)H)ee! + ee! (A-G(t)H) 'q 
+ 2gew!(t) - Zev! (t)a!(t)Q] (5.5) 


In order to remove the random nature from this problem we take 
the expectation of V and V. The forcing functions w(t) and v(t) 
are uncorrelated white Gaussian random processes. Consequently, 
the estimate error is also a Gaussian random process. We define 
the error covariance matrix P = PVecinlr: Thus the Lyapunov 
function (5.2) becomes : | 
E[V] = tré[Qee'] = tr(QP) (5.6) 
The expectation operation is done over an ensemble of Lyapunov 
functions each of which is the result of one experiment. Taking 
the expectation and applying (5.4) to (5.5) results in 
ELV] = tr{Q(A-G(t)H)P + P(A-G(t)H) 'Q] 


+ 20ELew (t)] - 2&fev!(t)]a! (t)o] (5.7) 
The expectation operator is defined as E[x] = ys  xf(x) dx 
where f(x) is the density function of x =< 


[ref. Papoulis, Pl] 
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To finish this problem we must first determine what ELew! ] 


and Efev!] are. Equation (5.1) has a solution of the form 
t Ct 
e(t) = o(t,t Jet.) +s o(t.r)wlt)dr - £  o(t,1)G(r)v(r)dr 
tS “ (5.8) 
But w(t), v(t) and e(t) are uncorrelated so that 


iwi = Elvv) = E(wle(t, )] = E(v'e(t,)] = © 45) 


Also define 


E[w(t)wi(t)] = We (ter) 


E[v(t)v'(t)] = Re (t-z) (5.10) 
where 6(t-t) = 2 : a 
Now using (5.8), form ew! . 
oe | T . T 
att) (t) = o(t,t )e(t )w (t) + s o(€tyc)w(rt)w (t)dt 
c 
O 
‘ 
~ f  6(t4)G(t)v(a)w! (t)de (5.11) 
. 
O 


Taking the expectation of (5.11), interchanging the order of 


integration and expectation and using (5.9),(5.10) 


E[e(t)wi(t)] = O+ s o(t,r)Ws(t-r)dr - 0 
ie 
O 
= 1/2 {wl 45.72) 


This follows from the sifting property of the Delta function®. 
Similarly we can find: 


Efe(t)v (t)] = -1/2 [@(t)R] (5.13) 


s The Delta function sifting property applied to an integration 
limit at the location of the Delta function is used here. 
b 
(compe WL FOR er = 2 FFD 
a 


6] 


Susi tutdmg (Sali2)wandel.5. 13 )eintow(Satdeamiedlds 
E[v] = tr[Q(A-G(t)H)P + P(A-G(t)H)'Q + QW + G(t)RG tee 


Note that if Q =I in (5.14) the result becomes the error 
dynamics developed by others. Specifically Athens and Tsi [A1] 
arrived at (5.14) in their derivation of the optimal filter and 
Sims and Melsa [S2] also arrived at this result for specific optimal 
filters. 

For discrete systems a difference equation for E[V(n)] analogous 


to equation (5.14) is easily obtained by using equation (4.13) 


to form 

Efe'ge] = tr{QP(n)] (5.15) 
similarly as for the continuous case. The'result for P(n) is 

p(n) = (I-G(n)H)P'(n)(1-6(n)H)' + G(n)RG!(n) (5.16) 


P'(n) = oP(n-1)o! + W 


where R is the covariance matrix for the measurement noise and W 
is for the noise driving the signal model. A derivation of equation 


(5.16) also appears in Sorenson Loe ie 


B. DERIVATION OF THE OPTIMUM G(t) 

In effect the stochastic Lyapunov function (5.6) for the filter- 
observer is a linear combination of all the covariance of errors 
between the estimates (outputs of filter) and the actual signal 
being estimated. The covariance of error gives a measure of how 
well the filter should be expected to perform. As such, the Lyapunov 
function (5.6) gives a scalar measure of the performance of the 
filter. It is desirable that the covariance of error be minimum 


for all t. Thus the Lyapunov function (5.6) may be used as a cost 
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criterion to be minimized. The optimum filter is then formed by 
choosing G(t) such that EL[v] is minimized for all t. If (5.14) 
is negative for all t > tos then 

ELV] < 0 (GJ } 
and the filter is ASIL since E[v] > 0 for all t. Moreover, if 
E[V] is negative, then E[V] is assured to be a minimum if the 
magnitude of its time derivative is maximum for all t. The optimum 
G(t) can be derived as follows, using the concept of a gradient 
matrix. The following formulas for the partial derivative of a 
scalar function (tr) of several matrices with respect to one of 


the matrices can be proven (Athans and Tsi [Al]). 


3 tr(AB!) = 0 tr(Bal) = A (5.18) 
a0 B oB 

5 tr(ACA') = 2AC (5.19) 
oA 


where C is symmetric. 
Taking the partial derivitive of (5.14) with respect to G(t) 
using (5.18) and (5.19) yields 


Ss Env) = 0 = - 2pH'® + 2G(t)RO 
56 
and solving for 
g(t) = PH'RT (5.20) 


This is the identical result derived by Kalman and others 
[Al, K3, $3] for the optimal filter. To complete the derivation 
of the optimal filter we must show that (5.20) results in a minimum 
for (5.14). Note the second partial derivative of (5.14) with 
respect to G, by applying (5.18) and the fact that R and Q are 


symmetric. 
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EL = 20R (5.21) 
aG 

If OR = RQ : (Smee ) 
R and Q commute, then the right hand side of (5.21) is positive 
definite since Q and R are positive definite. This follows from 
the result indicated in Bellman [B1] given on page 33 of chapter III. 
Now if (5.21) is positive definite then (5.20) does result ina 
minimum for (5.14). But if (5.17) holds, then the magnitude of 
E[V] is maximum (for G in (5.20)), hence E[V] in (5.6) is a minimum. 


Moreover, the ratio 


-E[V] | (5.23) 
ELV] 
is also maximum, and the following important conclusion is reached. 
(1) The filter using the G(t) given 
by (5.20), is assured of having 
the most rapid convergence rate 
to the minimum covariance of error. 
This follows from the discussion on page 36 of chapter III, section G. 
The foregoing derivation strictly holds only if conditions 
(5.17) and (5.22) are true. However, the result indicated in 
(1) above does give engineering insight into the nature of optimal 
filters and to desired properties of suboptimal filters. That is, 
it is possible to make an engineering trade-off between convergence 


rate and steady state covariance of error in the design of suboptimal 


filters which are much easier to implement than the Kalman filter. 


C. CONCLUSION 
The concept of a Lyapunov function for random variables was 


introduced for the forced filter dynamics. A derivation of the 
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optimal filter gain matrix was given leading to the conclusion that 
the optimal filter is the most rapid converging to the minimum 
covariance of error. As a consequence,’ it is possible in the design 
of suboptimal filters to make an engineering trade-off-between 


convergence rate and steady state estimation error. 
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VI. AN EXAMPLE OF A SUBOPTIMAL FILTER . 


A. INTRODUCTION 
Consider the discrete form of a system described in phase variable 


form with only one state being observed. Namely: 


Ores) een 
x= AX 
where i H = 0 0.2 
y = Hx 
; (Grey 
O | 
“ay “25 


It is assumed that A is stable and H is a 1 xn matrix, and 
that o(T) has been found as described in Section IIA where T is 
the sampling period. Note the transition matrix for the discrete 
filter (4.12) 

x(nt1) = (@(T) - G(n) H @(T))x(n) + G(n)z(n) 


If H is 1x n, then G(n) must be nx 1 in order for the indicated 
multiplication to be conformable. This is indeed fortunate, since G(n) 
is chosen to insure that the filter is stable by application of the 
lemmas in Section IV C, and these lemmas only give n_ constraining 
relations for the elements of G. 

Proceeding, we determine the system Lyapunov function V = a! Oe 
for AV = =e "Ce by finding Q such that o!o0 - = -C for a given 
positive definite C. This may be easily done using the algorithm 
developed in Appendix B. 


Now, in order to insure stability, the matrix M(n) in (4.41) must 


be positive semi-definite in accordance with Lemmas 1] and 2 developed in 
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Section IV C. The argument, (n) , will be dropped for convenience 
in the following and the notation used for Q is: 


Q = (b,.) and for G = (g.) 


Ty”)CUAT XH yt Wet 


Repeating equation (4.41) here 


M 


ach + H'e'o - ul! 6! ocx (6.2) 


Multiplying (6.2) out, noting that: 






Lo 
T G 
S= (9, 9, .- age) = 
m= LG 0] nxn 
| 
n | 
te 
QGH = | 
ea 
n | 
Pas 
7 ea 
nxn 
aoe is in fact a quadratic form in the vector G and 
er 
0 0 
le 
0 0 
then H'cloch = (2 bez 9. g.) HUH (6.4) 
i “Shel ad , 
Substituting the results, (6.3) through (6.4), into (6.2) 
@ la oe - tite de lie. | q. 
(9159; ij Ds 39595) — ® a9 * Png 
M =| bog. 6.5 
? 2493 ( ) 
» P59 


6/7 


Now the aL. of Lemma 2 are: 


le cP 
jel il es 








According to Lemma 2, a. > 0 however, it is evident that for 
i > 2 we must choose a. = QO since they can never be positive. This 
results in n- 1 linear equations in n_ unknowns depicted by: 


n 


b.. gq. = O i => Zeca. 6.6 
a a” & 1 n (6.6) 


Choosing to solve these n- 1 equations for Jo ++. Op in terms of 


Gy > rewrite equations (6.6) in matrix form 


bon §=— Bag bon Go - boy 
b39 93 - bay 

= q, (6.7) 
bn2 a Din In aor 


The square matrix in (6.7) must have an inverse, since it is a 
principal minor of Q which is positive definite.” Thus, (6.7) may in 
fact be solved for Go +++ G, in terms of 9). The remaining relation 


necessary to determine 9, is given by Oy of Lemma 2. 
= | b..g. 9. - 0 (6.8) 


But oe must be non-negative, giving: 


Gis poe “Tee Dea ane (6.9) 


r b 
‘i j - ne ig 


j=l "J 


othe determinants of the principal minors of a positive definite 
matrix are positive [ref. B2]. 


68 


Using equation (6.7) and relation (6.9) a family of G matrices may 
be determined. Since some Ot is zero, the stability condition as 
discussed in Section IV C following Lemma 2 js obtained by choosing 
D=C. Referring to Section III G the convergence bound for this filter 
1S 

aor = 


= ~1 
ngn(DO) = Aggg (CO!) (6.10) 


But, (6.10) is the convergence bound for the signal model since C 
was chosen for the signal model. This means that the transient behavior 
of the filter is assured to be no worse than that of the signal model. 

Unfortunately, the foregoing Lyapunov technique developed cannot be 
used directly for optimal filters involving the phase variable form of 
Signal model. In particular, AV , equation (4.18), has not been shown 
to be negative definite for the optimal filter in this example. How- 
ever, a reasonably good comparison of the suboptimal filter design and 
the optimal filter may be made by computing tr(QP(n)) , equation (5.15) 
for both filters using the Q determined from the signal model. This 
follows because the Lyapunov function (5.15) is a scalar function of all 


the covariance of error for the filter. 


B. NUMERICAL EXAMPLE 

In order to summarize the foregoing results, a simple third order 
numerical example is given. 

A suboptimal filter-observer is to be designed for the given signal 
model. The filter is required to be ASIL. 

The signal model is described as follows. Given the signal dynam- 


ics described by the continuous time matrix in phase variable form 
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0 1 0 
A=] 0 0 1 (6.11) 
aioe =| slic 
the discrete version of this signal model for a sample period of T = .15 
is driven by a white Gaussian random sequence whose covariance matrix is 
given as 
0 0 0 
n= PO 0 0 (6.12) 
0 0.’ 0.0086 
The corresponding discrete system for (6.11) was found for T = .15 
by using a digital computer. For this, a common NPS subroutine called 
AT 


"PHIDEL" was used which essentially computes the series fore. This 


resulted in: 





.994 A eee 0965 
OS ia coal al O91 884 = 0653 (6.13) 


= 1O7ome 4 - 1aC07 101 
Only one state of the signal model is observed and 
Howie 0 0) . (6.14) 


The observations are contaminated by additive white noise whose 


covariance matrix is 
R =] 0 i 0 (Cais) 
This completes the description of the signal model. 


The first step in the design of a suboptimal filter is to find a 


Lyapunov function for the signal model. 
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Assuming that C =I the Q matrix of the Lyapunov function for 
this system was found using the algorithm developed in Appendix B. For 
this problem the algorithm converged in 42 iterations to a stopping 


criterion of 





max 
= d; 5(k) <  ,0005 (6.16) 
where d s(k) is an element of the ,th correction matrix. See 


Appendix B for further explanation. 


The resulting Q is: 


13.9 7.56 0.357 
Qe = 7.56 13.6 0.789 (6.17) 
0.357 0.789 1.08 } 


In order to design the filter, G must have the nx 1_ form 


Sy 
G = Go (6.18) . 
93 
and equation (6.7) must be solved. Substituting from (6.17) into 


(6.7) and solving for go and g, results in 


se 0.789 Io -/.56 
= 9 (6.19) 
0.789 1.08 93 -0.357 
G5 0.768 -0.0561 -/.56 
Go ~0.561 0.968 -0.35/7 
Go = -0.56 9; 
(6.20) 
9, = 0.055 dy 


7] 


The constraining relation (6.9) is used to obtain a restriction 


on g,- For this example (6.9) becomes 


seo 9) + 7.56 Io +O gs0 7 G3 > V9 oy + 2( 7450) a2 


st 2(\0le3 5 75) 9493 


+ 13.6 gy 


+ 2(0.789) 9595 
Z 
08 9. 
Substituting the relations (6.20) and reducing this equation yields 


9,69 9; 9.69 ,° 


Jv 


g, < | for g, > 0 (6A) 


Thus if 9 is constrained by equation (6.21) for all n , and 
| Jn293 are determined by (6.20), the filter is sure to be stable by 
its construction using the sufficient conditions of the Lyapunov method 
in conjunction with the Hadamard-Gerschgorin Theorem. 

The simplest kind of filter to implement is one with constant gains. 
A simple time varying gain which satisfies (6.21) js: 


gh = a s@efor ne | 2.3%. .« 


n 


As discussed in Section VI A, a comparison of these filters with 
the optimal filter can be made by plotting tr(QP(n)) for each filter. 
The covariance of error matrix P(n) was computed on a digital 

computer for the optimal filter and for the following two suboptimal 


gains established by (6.20) and (6.21). 


] T/n 
G, = |-.56 Go = 1{-.56/n (6 122) 
055 .055/n 
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Computer subroutines for these calculations appear in the Computer 
Program Section. The subroutine "GAIN" was used for the optimal filter 
and "SUBCOV" for the filter gains in (6.22). Then using the Lyapunov 
matrix, 0, in (6.17) 
ELV] = tr(QP(n)) (6.23) 

was plotted on the same axes for all three filters as shown in Figure 4. 

It is interesting to note that all three filters are stable, and 
the optimal filter does converge faster than the suboptimal filters as 
predicted in Chapter V, Section B. The optimal filter has its greatest 
advantage in the first one or two sample periods. That is, the correc- 
ted covariance of error (P) is changing more rapidly here than for the 


suboptimal filters. 


C. CONCLUSIONS 

The main developments in this thesis show that it is feasible to 
design suboptimal filters which do perform nearly as well as the Kalman 
filter. This is of great advantage because suboptimal filters often 
lead to a much simpler implementation. Such filters are designed so 
that they will be ASIL and converge faster than a known bound. The 
design is accomplished by using Lyapunov functions and requires that the 
Signal model be ASIL. 

There are two areas brought out by this thesis which require further 
research. 

1. Find a suitable bounding function for the time derivative of 
the Lyapunov function (4.22 or 4.23) for the optimal filters. 

2. Find bounding relations for the covariance of error matrix, P , 


Similar to those developed in Theorem 3.5 for the system state. 


13 


The two most important properties of the filter-observer system are 
its convergence rate and its ability to discriminate against noise. By 
combining any results obtained for 1. and 2. above, with the techniques 
developed in Chapter IV of this thesis, a design procedure for sub- 
optimal filters could be formulated which would insure that the filter 
be stable and also give information about its two most important prop- 
erties. Hence, such designs could be formulated and compared with 


other such designs without the necessity of simulating each filter. 
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APPENDIX A 
NORMS 


A norm is a scalar function of several variables which satisfy 
the following conditions. The norm is denoted ||-|/| 
i) ||X]] > 0 and ||X|| =0 iff X=0 


ii) |foxX!| = Jol [|X] where |w| is the magnitude 
of a complex scalar 


iii) [IX + Y¥{| < []X]] + ]]¥] | 9 Minkowski inequality 


Possible norms for an n vector x 


n 
a 2 eesup 
xl] = 2 [xy Ixl| = ,SHP ix. 
1=] ae 
xt] = C2 [xg]? 1% = (tr(xx"))!7¢ 
Hx] = (2 fxg]P yl? 1<P<o 
Possible norms for a general matrix A 
IAI] = [er(aa'y3!/* HAI] = 2. lay, 
ToJ 
n 1/2 
max 2 
JA|| = ; a Jay, IIA] = - Ja; 5 
lol i 
ITA] | = sup | [Ax] | 
P= oe 
Other properties — Schwarz inequality 
IAB] | << [AT] x 11BII ecm | |<) 


Further details may be found in Ref. Kl, S1, Tl. 


76 


APPENDIX B 
Lyapunov Function for Linear Discrete System 
Davison and Man [D3] have derived an algorithm to numerically 
generate a Lyapunov function for linear time invariant systems 
given a negative definite quadratic form for V. Here a similar 
algorithm is developed for-use with linear discrete systems. 
In section III C it is shown that for the discrete system: 
x(nt1) = A.x(n) (1) 
a positive definite Lyapunov function of the form: 
v= xox (2) 
exists with: 
ww = =x!¢x (3) 
being negative definite. If the system (1) is stable) Q is 
determined for any positive definite € from the relation 
-C = ApQAy - Q (4) 
In general it is difficult to solve equation (4) for Q even 
given that C= 1 let alone some arbitrary positive definite C. 
So another approach is taken and assumes that the system described 
by (#) is "ASIL. 
Proceeding, we write the Lyapunov function following the 


system trajectory described by (1) as: 


Vin) = V(O) + V1) - WOO) #82) - WO) & ... + Vin) - ¥(ne1) 


since mk) = Vik) -¥ 


then Vin) = VO) + AV(i) (5) 


7/ 


Now let n approach infinity then V(n) approaches zero by 


assumption that (1) is ASIL. Then (5) becomes: 


co 


V(0) = - xz AV(i) 
1=0 
Using equation (3) 
v(O) = r c Cx. (6) 
1=0 
Noting that the solution to (1) is: Xa Ap Xo and that 


| 
V(0O) = Xo Ox, 
We rewrite equation (6): 
T co 
x Qx = rx 
Y 0 


which implies that: 


Q= f(A 
ico 


7} 
CA 


this summation will converge if equation (1) is stable, since 
|>(Ap) | < 1 and will not ‘converge if (1) is unstable. 
The proposed algorithm as stated here is easily implemented 
on any computer system having appropriate matrix subroutines. 
—" 
Dyes iy Dpentiy 


+ Dy Poe 


ae ek 
with initial conditions: 
| DE amet 


and termination criterion: 


1 1Dy 441 me 


The matrix, D, is the correction matrix and a simple conver- 


gence criterion is to terminate the iteration when the magnitude 
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of all elements of the correction matrix are less than some desired 
amount. That is: 
max jd. | < ce where D = (di 5) 
ld 
Subroutine "LYAP", given in the computer program section following, 
was used to generate the Lyapunov Q matrix for C = I in the third 
order example given in chapter VI. The result is given in equation 


heel? ) . 
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